On Coding Non-Contiguous Letter Combinations
نویسندگان
چکیده
Starting from the hypothesis that printed word identification initially involves the parallel mapping of visual features onto location-specific letter identities, we analyze the type of information that would be involved in optimally mapping this location-specific orthographic code onto a location-invariant lexical code. We assume that some intermediate level of coding exists between individual letters and whole words, and that this involves the representation of letter combinations. We then investigate the nature of this intermediate level of coding given the constraints of optimality. This intermediate level of coding is expected to compress data while retaining as much information as possible about word identity. Information conveyed by letters is a function of how much they constrain word identity and how visible they are. Optimization of this coding is a combination of minimizing resources (using the most compact representations) and maximizing information. We show that in a large proportion of cases, non-contiguous letter sequences contain more information than contiguous sequences, while at the same time requiring less precise coding. Moreover, we found that the best predictor of human performance in orthographic priming experiments was within-word ranking of conditional probabilities, rather than average conditional probabilities. We conclude that from an optimality perspective, readers learn to select certain contiguous and non-contiguous letter combinations as information that provides the best cue to word identity.
منابع مشابه
Evidence for Letter-Specific Position Coding Mechanisms
The perceptual matching (same-different judgment) paradigm was used to investigate precision in position coding for strings of letters, digits, and symbols. Reference and target stimuli were 6 characters long and could be identical or differ either by transposing two characters or substituting two characters. The distance separating the two characters was manipulated such that they could either...
متن کاملA Dual-Route Approach to Orthographic Processing
In the present theoretical note we examine how different learning constraints, thought to be involved in optimizing the mapping of print to meaning during reading acquisition, might shape the nature of the orthographic code involved in skilled reading. On the one hand, optimization is hypothesized to involve selecting combinations of letters that are the most informative with respect to word id...
متن کاملDeep generative learning of location-invariant visual word recognition
It is widely believed that orthographic processing implies an approximate, flexible coding of letter position, as shown by relative-position and transposition priming effects in visual word recognition. These findings have inspired alternative proposals about the representation of letter position, ranging from noisy coding across the ordinal positions to relative position coding based on open b...
متن کاملThe impact of letter spacing on reading: a test of the bigram coding hypothesis.
Identifying letters and their relative positions is the basis of reading in literate adults. The Local Combinations Detector model hypothesizes that this ability results from the general organization of the visual system, whereby object encoding proceeds through a hierarchy of neural detectors that, in the case of reading, would be tuned to letters, bigrams, or other letter combinations. Given ...
متن کاملComment on "Linguistic features of noncoding DNA sequences"
In a recent letter [1], Mantegna et. al. report that certain statistical signatures of natural language can be found in non-coding DNA sequences. The vast majority of DNA in higher organisms including humans consists of non-coding sequences whose function , if any, is unknown. Hence this new analysis is quite important. It suggests, as the authors concluded , " the possible existence of one (or...
متن کامل